correlation matrix

Terms from Artificial Intelligence: humans at the heart of algorithms

Page numbers are for draft copy at present; they will be replaced with correct numbers when final book is formatted. Chapter numbers are correct and will not change now.

A correlation matrix is formed when you have multiple features on a stream or set of data, for example multiple database records or time series. The correlation cooefficient is calculated for each pair of features and set in a matrix. The matrix is symmetric as the correlation cooefficient does not depend on the order of the features, and the diagonal is the correlation between a feature and itself.

The correlation matrix can be used as a means to visualise the relationships between features, and in particular if there are some groups of features that are all closely related with one anther. It can also be used as a pre-processing step before other analysis such as clustering of features or dimensional reduction.

Used on Chap. 7: page 134; Chap. 8: page 159